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Abstract 

We provide a complete work-flow, based on the language of quantum information theory, 
suitable for processing data for the purpose of pattern recognition. The main advantage of 
the introduced scheme is that it can be easily implemented and applied to process real-world 
data using modest computation resources. At the same time it can be used to investigate 
the difference in the pattern recognition resulting from the utilization of the tensor product 
structure of the space of quantum states. We illustrate this difference by providing a simple 
example based on the classification of 2D data. 
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1 Introduction 

Quantum machine learning aims at merging the methods from quantum information processing 
and computer engineering to provide new solutions for problems in the areas of pattern recog¬ 
nition and image understanding PlIIlllo]. From one side the research in this area is focused 
at applying the methods of quantum information processing for solving problems related to 
classification and clustering in signal processing. From the other perspective the methods for 
classification developed in computer engineering are used to find solutions for problems like 
quantum state discrimination mm a [5], which are tightly connected with the recent develop¬ 
ments in quantum cryptography. 

The aim of the paper is to present a complete work-flow, based on the mathematical for¬ 
malism used in quantum information theory, suitable for processing data for the purpose of 
pattern recognition. The main goal is to introduce the method for describing classical data 
using entangled quantum states and to investigate the difference between the entangled and the 
separable representations of states. The presented work-flow enable the processing of real-world 
data as it can be implemented and executed on standard desktop computers. The main reason 
why this is possible is the introduction of the quantization procedure which allows the reduction 
of the dimensionality of the utilized quantum states. 

This paper is organized as follows. In Section we provide some notation and initial consid¬ 
erations related to the problem of using quantum states for the purpose of pattern representation 
and classification. In Section we propose a general framework for encoding data using the 
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formalism of quantum states and describe the basic classification of the ID data using this rep¬ 
resentation. In Section we introduce a formalism for constructing representations of feature 
vectors and provide some examples of using entangled and separable flavors of the introduced 
formalism in 2D case. Finally, in Section we provide a summary of the presented work and 
provide concluding remarks. 


2 Preliminaries 

We start by introducing the notation from the area of quantum information relevant for the pur¬ 
pose of processing patterns. We also review the problem of pattern classification and introduce 
the notation used in the paper. 


2.1 Kets, tensors and features 

In this paper we use the notation of quantum mechanics in which vectors are written using ket 
symbols, such that 

l«i> (1) 

is a base vector with label oi. In the simplest case of two-dimensional space, which can be used 
to represent a bit on a quantum computer, we have 


| 0 ) = 


and |1) = 


Accordingly, for A^-dimensional space we have 




-2^^ position, 


\ 6 / 


( 2 ) 


(3) 


with i G {0,1,..., — 1}. We also assume that ket vectors are normalized. 

This notation can be used to express any combination of vectors as 


a = \a) = 




an). 


(4) 


Usually we require that Yli = 1- 

Let us now assume that we deal with p features, which are measured in the experiment. 
With each feature we connect a vector space Fi of dimension Ki , necessary to store the possible 
measurement results. 

A pattern consisting of p features can be represented as a tensor product of feature spaces 
as 

(5) 

where (8) denotes tensor (a.k.a. Kronecker) product [7j. For two vectors |a) G Fi and \h) G F 2 

and 1 ^) ~ ( ■ 1 
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tensor product is defined as 


|a) (g) \b) = 


/ a\bi ^ 


(7) 


\ “iTj bK2 ) 

and it is customary to write |a) (g) |6) = |a, 6) = \ab). 

One of the features of the above formalism is the seamless notation of the scalar product as 


a-b= {a\b) = Y^aibj{i\\j). (8) 

hi 

Elements {i\\j) = dij can be written as {i\j) and they represent scalar products of the base 
vectors and (o| represents the vector dual to |o). 


2.2 Pattern recognition 

Let us now assume that we are dealing with the problem of the supervised learning. In this 
situation we have at our disposal a number of representatives associated with labels denoting 
classes. The classes used to discriminate are constructed using a number of measurement results, 
each one representing a pattern of a known class Pi, 

Ci = (4*^ : fc = 1,.. (9) 

where number of representatives n* can be different for each class. The set of class elements 
with the associated labels is refereed to as the learning set. 

The classification of the unknown patter x is performed using the score function g{x,i), 
calculated for the pattern and each class. In the simplest scenario, the score function is cal¬ 
culated as a distance between the unknown pattern and the class representatives. The class 
(i) 

representatives pli' for each class z = I, 2,..., C are built by the appropriate averaging over the 
class elements. 

The unknown pattern is classified according to the greatest value of the score q^'^\x), calcu- 

(i) 

lated as the distance between the pattern x and the class representatives , i = 1,2,... ,C, 

g«(x) = T>(pW,x). (10) 


3 Pattern representation in quantum formalism 

In this section we develop a quantum representation for patterns and classes. We focus on the 
patterns distributed with the normal distribution, but the provided framework does not depend 
on this assnmption. We start with the simplest case of ID data and subseqnently we consider the 
possible generalizations of the introduced formalism for the purpose of discrimination patterns 
described by vectors of features. 

3.1 General framework 

In order to define a quantum classification procedure one needs to specify the mapping between 
the space of valnes and the space of quantum states. This is exactly the counterpart of the 
quantization function used in machine learning algorithms. However, in this case, this function 
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is used not only to reduce the dimensionality of the problem, but also to obtain a quantum 
representation of patterns and classes. 

Here we aim at representing data in the form of vectors — pure quantum states — or corre¬ 
sponding projection operators. In principle one can extend the presented formalism to include 
mixed quantum states, but the particular choice of pure states enables the straightforward 
utilization of quantum superposition and quantum entanglement. 


3.1.1 Distance-based methods 

From the quantum information point of view, the most natural formulation of the pattern 
classification problem is based on the distance minimization. 

In distance-based methods we are interested in the distance between the class representative 
and the pattern we wish to classify [2] . In order to use quantum states for the purpose of pattern 
classification we use a framework based on the following assumptions: 

(1) Patterns are represented as normalized vectors. 

(2) Classes are represented by the normalized superposition of the representatives. 

(3) Similarity between the patterns is calculated as a scalar product. 

The first assumption from the above list requires mapping the classical data on the quantum 
pure states, 

x^\f{x)). (11) 

In the following the procedure executing this step will be called the quantum quantization 
and its goal is twofold. Firstly, it is required to reduce the feature space and generate the 
quantization symbols. Secondly, it calculates the dimensions of the state space used and maps 
the quantization symbols onto this space. The quantum quantization step is crucial from the 
point of view of analyzing the quantum correlations as it effectively defines the structure of the 
used state space. 

For a given quantum quantization procedure and a selected distance measure one obtains a 
complete procedure for quantum-based classification. In order to classify an unknown pattern 
X one has to execute the following steps. 

Step 1 : Perform quantum quantization by mapping class elements 

4*^ ^ i/(4*4)- 


(i) 

Step 2: Prepare the class representatives for each class i = 1, 2,..., C by the appropriate 


averaging over the quantum class elements. 

3: Calculaf 
1,2,...,C, 


(i) 

Step 3: Calculate the distance between the pattern x and the class representatives , i = 


q^^\x) = D{p^^\x). 


Step 4: Classify the pattern according to the greatest value of q^'^\x). 

ii) 

One should note that the class representatives p)i , i = 1, 2,..., (7 can be obtained as a flat 
superposition of the learning set elements. We will use this method in the next part of the 
paper. 
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3.1.2 Nearest neighbors method 

The quantum states obtained using the quantum quantization procedure can be used in any 
scheme for pattern recognition. For example, in the case of k-NN scheme, one needs to perform 
the following steps. 

Step 1 : Perform quantum quantization by mapping class elements 

cf ^ \f{cf )). 


and the pattern for each class i = 1, 2,..., C. 


Step 2 : Calculate the distances gj {x) = D(|/(cj )),x) between the class representatives p\i 


(i) 


Step 3: Classify the pattern according to the voting using values of q^j\x) uilizing k values. 


However, in this case the procedure is reduced to the classical one as it does not utilize the 
superposition of the base vectors. For this reason we restrict our attention to the distance-base 
methods. 


3.2 ID case with flat subspaces 

In the following we focus on the features represented as numerical values. 

We start by considering the ID feature vectors and introducing the quantum quantization 
procedure. In this case features (and class elements) are represented using flat representation 
in the subspace of the appropriate quantum state space. In this context flat means that pat¬ 
terns are represented by first transforming them into equally distributed intervals and next by 
representing the intervals as quantum state vectors. 

In orders to fix the dimension of the quantum state space required to encode the quantization 
symbols, we will use the data from the learning set. We also require that for the fixed state 
space of dimension d, vectors |0) and |d — 1) will encode all elements smaller and all elements 
larger than the elements in the learning set. For this reason we required that in the case of two 
classes 

d = |[max(C'i U C 2 )] — [min(C'i U C 2 )] \ + 2. (12) 

Here [c] is the nearest integer function, sometimes denoted as nint(c), defined as 

-Dg“W[|cn, if{c}<l/2 

if{c}>l/2 ’ 

where {c} denotes the integer part of c, {c} = | [|c|J — |c|| This function is implemented as rint 
function in math. h C standard library and as Round function in Mathematica system. 

For a given number c we construct its quantum representation as vector 

9fs(c) = |[c] - {min(C'i U (^ 2 )} -M), (14) 

in d-dimensional space of pure quantum states. 

The above procedure can be implemented in Mathematica system as 

qMap[x_, nsl_ , ns2_] := Block [{qMin, qMax, qDim} , 
qMin = Round [Min [Join [ nsl , ns2 ] ] ] ; 
qMax = Round [Max [ Join [nsl, ns2]]]-|-l; 
qDim = Abs [ qMax — qMin ]+ 2 ; 

Which [ 


(13) 
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X < qMin , Ket [ 0 , qDim ] , 

X > qMax, Ket [ qDim — 1, qDim ] , 

True, Ket [Round [x] — qMin + 1, qDim] 


Using the above quantization we define for each class its representation as 

where \ jli) = represents an unnormalized superposition of the elements representing 

the class. 

Finally we introduce quantum flat subspace (QFS) classifier using the score function 

(16) 

for i = 1, 2,..., n representing classes Ci, C 2 ,..., Cn- Using this we classify pattern c into class 
i iff 

> sr*(c) (17) 

for all i ^ j. 

Example 1 Let us consider two samples of real numbers Ci and C 2 distributed with 1) 

and Af{2, 1) respectively. 

For the purpose of classification we represent each class as a normalized superposition of its 
representatives. For example if Ci is represented by 

{-2.24697, -1.17115, -0.882941, -1.9828} (18) 

and C 2 by 

(0.836746,1.70144, 3.05605, -0.0344292} (19) 

we get quantum representations for both classes as 

lm) = ^(|0) + |l) + 2|2)) (20) 

and 

lm> = l(|3> + |4> + |5) + |6» (21) 

respectively for Ci and C 2 . Both j^i) and \p 2 ) o,re normalized and each component has an 
amplitude representing a relative frequency of the appropriate element in the learning set. 

One should note that we need to choose the dimension of the subspace encoding the integer 
part to be large enough for encoding all representatives. For this reason the representation 
will require higher dimensional state space if the learning set contains more elements. In this 
example the observations are encoded in the space of dimension d = 7. 

This classification gives the following output 

{(-4,1), (-3,1), (-2,1), (-1,1), (0, 2), (1, 2), (2, 2), (3, 2), (4,1)} (22) 

where each pair represents a number and the assigned class. 

Note that number 4 has been classified as belonging to class 1 due to the lack of the represen¬ 
tative corresponding to this number in both classes. For this reason gi^^{4:) = g^^^{4) = 0 and 
the classifier cannot decide about the result solely on the data included in the quantum state. 
To avoid the inconclusive results we assign the pattern to the arbitrarily chosen class. 
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4 Higher dimensional generalizations 


In this section we provide methods for using the quantum encoded patterns consisting of vectors 
of features. We introduce two basic schemes for encoding patterns and classes - separable and 
non-separable. Next, we provide some examples of using these schemes. We also propose 
a hybrid method which only partially exploits the tensor product structure of multipartite 
quantum states. 

4.1 Quantum representations of feature vectors 

Let us use the learning set illustrated in Fig as an initial example. In this case the elements 
of the class are represented as quantum states |00) and |1I). We can use this representation 
and construct the representation of the class as the vector ^(|00) + |1I)). In this situation we 
obtain the state which is entangled. However, we can alternatively construct the representation 
of the class in Fig. [^component- wise and in this case we obtain a pair of vectors resulting from 
the quantum averaging of the components, namely (;^(|0) + |1))> ^(|0) + |1)))- This can be 
translated into the representation in the tensor product space as |(|0) + |1) + |2) + |3)). 



1 



|2> 




ID 


• 


|0> 

• 




|0> 

ID 

|2> 


Figure 1: Learning set with two elements from the same class and the possible quantum quan¬ 
tization assigned to each coordinate. 


4.2 Separable pattern representation 

The most straightforward extension of the ID example to the case of patterns represented by 
vectors is obtained by using the ID classifier independently from each component. In this model 
the features are encoded into corresponding quantum states and the classification is performed 
component-wise. As a result one obtains a vector of results, which indicates the classification 
of the pattern. 

Following the example presented above, we introduce the separable data representation as 
follows. 


Definition 1 (Separable data representation) For a vector of features x = (xi,X 2 , ■ ■ ■, Xn) 
the quantum counterpart obtained using the separable data representation is constructed as n- 
tuple of quantum representations of the components, ie. 


x) 


( hi> 

\^ 2 ) 

\ hn> 



(23) 
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Accordingly, one can introduce the quantum averaging used to represent classes of features 
as an averaging over the components. This can be expressed as follows. 


Definition 2 (Separable class representation) For a class consisting of n-dimensional fea¬ 


tures vector, Ci = 
tained as 


, the quantum separable representation of the class is ob- 




/ T.U \ 


(24) 


)/ 


ii) 

where cyi denotes the l-th element of the j-th feature vector in the class Ci 


In the simplest case of 2D feature space we have 

|(^^l,^^2)) = (j^2>) 

and the class representations are constructed as 

^ ( E?=i H,f) \ 

^ Kd ) ■ 

One should note that in the introduced scheme the Dirac notation is overloaded and has 
to be interpreted according to the type of its argument. Moreover, the quantum quantization 
procedure required to build the representations of components has to operate on the quantum 
state space of dimension appropriate for each feature. 




4.3 Non-separable pattern representation 

One of the main differences between the classical and quantum mechanics is the structure of 
the space used to describe the composite system. In the classical case such space is constructed 
using the Cartesian product of the subspaces describing the subsystem, whilst in the quantum 
case the composite systems are described using the Kronecker product. 

In quantum information theory the main phenomenon arising from the use of the tensor 
product is the entanglement. As the entanglement can be seen as a form of strongly non-classical 
correlation, we expect that the usage of the tensor product structure allows us encoding and 
exploiting the correlation between the features. For this reason we introduce the non-separable 
scheme as follows. 

Definition 3 (Non-separable data representation) For a vector of features x = (xi, X 2 ,..., Xn) 
the quantum counterpart obtained using the non-separable data representation is constructed as 
a tensor product of quantum representations of the components, ie. 

|x) = |xi) (g) |X2) (8) • • • (8) |Xn). (27) 

In the simplest case of 2D feature space we have 

|(ui,U2)) = |ui) (8) |U2). (28) 

Definition 4 (Non-separable data representation) For a class consisting of n-dimensional 
features vector, Ci = the quantum non-separable representation of the class 

is obtained as 
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(29) 



The main difference between the above formula and the formula in Eq. (24) is that it maps 
feature vectors onto vector space of dimension di x d 2 x ■ ■ ■ x dn, where di, i = 1,2,... ,n, are 
dimensions of spaces required to encode features. The separable data representation requires 
only the space of dimension Yli di- 

In the case of 2D feature space the above encoding can be implemented in Mathematica as 


qMuEnt2d [ k_ , ns_ ] := Block[{ml, m2, v}, 

ml = qMap[# , ns [ 1 , 1] , ns [2 , 1 ] ] & /@ ns [k , 1 ]; 
m2 = qMap[# , ns [ 1 , 2] , ns [2 , 2 ] ] & /@ ns [k , 2]; 

V = Plus @@ Flatten [ Table [x 0 y, {x, ml}, {y, m2}], Ij; 

V /Norm [ v ] 


where function qMap was defined in Section 3^ 


4.4 Example: pattern recognition in 2D 


The main advantage of the scheme introduced in this section is that it can be easily implemented 
and executed for the real-world data. In this section we present a simple example illustrating 
the difference between the separable and the non-separable representation of data. 

In order to execute the classification using the representations introduced above, one has 
to specify the score function used in the final step. In the case of the non-separable data 
representation we use the natural choice, namely the overlap between the class representation 
and the quantum state representing the pattern, 

g^{x,i) = \{fif\x)\. (30) 


In the case of the separable data representation we will use the score function defined as 


g^ix,i) = 





(31) 


where denotes the k-th. component of the separable quantum representation defined in 

Eq. (@. ’ 

The above definitions allow the comparison of the efficiency of the pattern classification. 
For this purpose we have used two 2D Gaussian sources producing the pairs described as 
(A/(1,1),A/(1,1)) and (A/(-2,1), A/(-2,1)). 


For the purpose of the presented experiment we have generated a sample of 100 learning sets, 
each consisting of n elements of both classes, with n = 2,3,...,16. For each learning set we have 
used two samples of 1000 points generated using the distribution describing the classes. Each of 
the generated points has been classified using both separable and non-separable quantum data 
representation. The test has been assessed as positive if the point had been classified to the 
class corresponding to the appropriate distribution. 

The results of the described experiment are presented in Fig. As one can observe the 
utilization of the non-separable data representation allows better classification of the test data. 
This is expected as the utilization of the tensor product enables the operations on the larger 
subspace for the purpose of encoding the quantization symbols. 


5 Concluding remarks and further work 

In the presented work we have introduced the complete quantum-inspired scheme enabling the 
processing of real-world data for the purpose of pattern classification. The introduced repre¬ 
sentations of data are based on the quantization procedure which reduces the dimensionality of 
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Figure 2: Success rate of the proper classification as the function of the number of elements 
in the learning set for the separable (black solid line) and non-separable (red dashed line) 
representation of data. 


the quantum state space required to process the data. Thanks to this one can implement and 
execute the introduced scheme on a standard desktop computer. We have also used the intro¬ 
duced framework to conduct a simple experiment where it is possible to observe the difference 
between the separable and non-separable representation of data. 

One should note that the presented considerations are based on the formalism of quantum 
mechanics, but do not require a working quantum computer. The utilization of a quantum 
machine would be beneficial from the security point of view as in this case the information 
about the class representatives is unknown to the party performing the classification. 

The introduced framework is based on a quantization procedure and the introduction of 
the linear combinations of quantization symbols. As such it can be easily adopted for other 
problems in machine learning. In particular we plan to apply it to the problem of clustering. 

From the quantum information point of view, the introduced framework can be generalized 
by using a formalism of mixed states in the quantization procedure. One of the advantages of 
such approach is the possibility of using other distance measures, based on the generalization 
of distance measures used in classical information theory. Moreover, the use of density matrices 
can be used for incorporating the information about uncertainty in the measurement results. 
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